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With the rapid deveiopni^^^ servicS 



throuj^ various PG: or^p|!|C dris kind of ubiquitouF 

multimedia services poJ|p^c*|£^ Since the current Interiiet ^ 

; h ^^?enepus ; i»^ ^bM^;,^a|^ wide range. This:ie<iuires::V^ 

icpdiiig^tet providejcp 
PC devices only hav^ 1^ 

techniques provide efficient video representation wito decoding: ico^le^ityl^ 

sttaightforward solution to meet the above requirements would be to compress the saine video se<^er^ into ra^l^ 
bitstrca^ to every and device coim>lex^. ^ 

bitstream to transmit to an individual liset ac<x>rding-^the actual cx>rmecUoh speed and device capability; Qbvio 
this would be a -great waste of system resource. On the other hand, even if we could manage to store all these bits ^ _ 
in the server, in a dynamically changing non-QoS guaranteed network, we still could not provide the best availabL 
video quality with a single bitstream. Therefore, an active research topic in video coding field is how to efficiently 
compress video: sequence with different scalabilities, such as rate, quality, temporal, spatial and complexity scalabilit^^ 

Spatial scalable coding can provide video at different resolutions to fit in a wide range of applications. Some rela^f 
techniques have been developed in the past years [1-7]. Recent video coding standards, such as H.263++ and MPEQ||f 
have also adopted spatial scalable coding techniques [8-9]. Fig. 1 illustrates a typical architecture of the traditi^!! 
spatial scalable coding scheme adopted in MPEG-4 standard. Generally speaking, in a traditional spatial scalable codif] 
scheme, there are two layers: low-resolution base layer and high-resolution enhancement layer. At the base layer^ 
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>olutions ? different computational power and different connection speeas^cannot-be sor^ c 
l^iht^lone:^^ fran^wprk^rti^ 

co mpresses a video sequence into three layers: one base layer* and^ two 
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resolulionl ^mejenhamtcement layers can still maintain good coding ^ efficiency : Tne basic idea ab^ 
^NRf i^alable'cai^g^w 

liframewbrid m ondef to further improve coding efficiency and reduce drifting errors at tfe 
j^^s^ fra^wbrk defines five coding modes with different forms in motion compemtioh and^rewnsmi . 
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enhancement layer is encoded with the same technique proposed in [i4]; |u on the discussion on i^to 

efficiently compress the HR enhancement layer. 
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Frame 2 • Frame 3 

Rg 2 : Tte proposed 

As illustrated in Fig. 2, the first frame of each GOP (Group of Picture)?is encoded as forward prediction frame at thel 
enhancement layer. The reference is derived from the current reconstructed base layer (at low resohitibii^if 
interpolation, wtere tte bihnear method is used for simplicity. In order to obtain the fine granularity scalability, thel 
plane coding technique is applied to compress the predicted DCT coefficients into an embedded bitstream insteadic^lL^ 
traditional ceding. The rest of frames in this GOP are predicted with two references. One is the low qualji^ 

reference constructed from the previous LR base layer after bilinear interpolation. Another is the high quality refereno^^ 
constructed from the HR enhancement layer. In fact, there are two methods for construction of the low-q^^ 
reference, from the base layer or from the LR enhancement layer. If the low quality reference is constructed from! 
LR enhancement layer, the base layer video is first improved at low resolution and then switch to high resolution^ 
would decrease the coding efficiency at HR video, because it is more efficient to allocate the bits used for>th# 
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3. FIVE !V^iSF^lVIACR6BI,OCK.BA^S) CODING AT THE mGH R^OLUTIQN 

oo^atto^can be Jiffe^jfrom^tbe prediction for tecbns^tion. •The>framework shown Fig. ,2 ican »be either 

^rt 3 ^ 1 ^ 5"? « at huicroplbck level. As suggested in [14], since macroblock-based c^ 

flSwv? ^ ma ^ ^ved , coding efficiehcy, we adopt the n^blo^ased approach ^S^S^* 

^^r^rf ° D T^^ P ^ * ^"P* ^ this s«% defies five SgnS£S 

shown iB , Fig, 4 for codmg macroblpcks at the ffll enhancement layer. In other words, each macroblock VM 
enteuK^ment layer can select the individual method for motion cornpensation and reconstrucS.^cooSg moS 
information . of eadh macroblock is encoded in the frame header by siinple VLC coding. ' m - 

^!?% * V ^S^S* ^^m^ and reconstructed both ^m the low qual|^^re^cuon >„plus u^e residual 
■WSm^ Since the bit^of Ae base layer is very low, it is reasonable to assume that the base Uyer bitstream c2 
a^ays^or^y ^^te clieg with ^^appropriate error protection, in oth^wprd^ iow^aify 
Rm, m m image s are always available at the decoder. Therefore, this mode does not cause any drifting 

gors. However, smce the prediction is derived from the LR base layer, the coding efficiency of this mode is usually 

to Mode 2, die HR image is compensated and reconstructed both from the HR prediction p Ht plus the residual image 
S " ^ghquality prediction is used at the HR enhancement layer, if all macroblodcs are encoded with this 

framework can achieve high coding efficiency as long as the high quality prediction is available at 
T ? "-IT ' .f mCC Mgh P^^ 00 contains several bit planes from the HR enhancement layer 

they may be dropped when channel conditions somehow deteriorate. In this case, the decoder has to use the corrupted 
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posed framework is expected to maintain high coding efficiency at high bit rates jana^reducert^ 



w "tasipossible at low bit rates by selecting appropriate cod^g ? mckle for each macxqblq^^As <fi 
jaii¥tnwirS^ enhancement layer. Thus, the criterion for selecting jM^i^on is jpyen as 
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^^^^^|;|is; the original HR video. When the .HR enhancement layer is compensated fe^ ^vIqwmi^^ 
fmu^me residual image, the predicted error image encoded is trie first item in criterion {l)J WKelti it is cbrnpensated from 
^^^j3iiquahty: prediction plus the residual image, the predicted error image encoded is tre v se<»na;item 

*||^iq^nsated ftpni the high quality prediction, the predicted error image encoded is: u^Un^ 
fn^iw-esti^tion, this criterion utilizes the sum of absolute difference (SAD) to measure '^^xM&ja^6pj^^^^:U 
In^ffir^tiit^m^of criterion (1) is the least for a certain macroblock at the HR enhancement layer, tfns macroblock is 
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the low quality reference instead and the rnaximal error mcurred 
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J^^M0WAffS^i\ ti&e drifting error at 1^ currei^ larger than ihfe allowable q^iatity 

S^O^^j^v^i^ omerwse mac^ encoded 
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This section verifies the, proposed fran^work with hybrid spatial and fine-grain SNR scalabilities throu^ extensive 

ex^riments, The perfonnan^ frainework (denoted as the proposed scheme) is con^ared^^^^^ of 

^,M P ^ G 'f t ^ dona ^ spffid scalable coding sche^ (denoted as the traditional scheme). TT^ KffEG^ test 

^^^ s ^P"^ ^ ^Mguard are used m the experm^htsi The base layer is encoded with the MPEG^4 advanced 
simi^f>r<^ 

: F6T--^|ft^iiM^|^ encoded as I^frame, and^tr^^ 

enc6<di%i^^ 

vectors^are^ 

iat both the 1R enhancement layer and m^ 

o the available cui^ 

^ m [14j^ same base ;layiery; : when the users or devices 

d^ire a low resolutibn but higher qi^ty video, the LR ermancemeht^a^ can be" leamly o^smitted to the chent to 
improve the video? transition from LR video to HR 

video using the HR enhancement layer. : , 



experimental results of die ^Coastguard sequehoe are given in Fig>5. m experiment (a), the bit rate of the base layer 
is 32kbps. The bit rate of the enhancement layer is 80kbps in the ^ traditional schenfe. In other words, SB R (Switch Bit 
Rate) is 112kbps in experiment (a). In experiment (b), the bit rate of the base layer is 64kbps, and the bit rate of the 
enhancement layer in the traditional scheme is 128kbps. SBR is 192kbps in experiment (b). In these two experiments, 
tjie HR enhancement layer bitetream of the proposed scheme can be arbitrarily truncated according to the available 
channel bandwidth. For example, when the available channel bandwidth is 128kbps, if the bit rate of the base layer is 
64kbps, the HR enhancement layer bitstream is truncated to 64kbps. The results of the Foreman sequence are given in 
Fig. 6. The base layer bit rate in experiment (a) is 32Kbps, and the enhancement layer bit rate in traditional scheme is 
64Kbps. The base layer bit rate in (b) is 64Kbps, and the enhancement layer bit rate in traditional scheme is 64Kbps. As 
shown in Fig. 5 and Fig. 6, the traditional scheme only provides two choices: either low resolution or high resolution. 
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Fig. 6: Comparisons between the traditional spatial scalable coding (MPEG4) scheme and the proposed scheme. 



^ base layer bit rate, the proposed scheme and the traditional,^ the 
Hh^?i b ^4^^^ base layer bit rale to SfBR, the proposed tra^wpi^l^ send more bits at the HR 

e^hai^n^nt;^^ the- national scheme still h^to ^dj^^b^ layer 

b;tstn|am only. Therefore, jtfee decoded quality of the proposed scheme is si^ficandx be^te 

scheme. When the channel bandwidth is equal "to SBR, the traditional decoder starts to decode the emiancement i&yfo 
bitstream. The decoded quality of the traditional scheme is about 0.7dB higher at SBR : 'tnra't^ 
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SSISIS^ " Snali 31 and significant ouality improvement is obtained whS the 
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